Skip to content

Pull requests: open-compass/opencompass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add PHYBench
#2125 opened May 27, 2025 by suencgo Loading…
3 of 6 tasks
[Update] Fix the None of self.config in run.py
#2121 opened May 26, 2025 by Zhaoyi-Yan Loading…
6 tasks
Add GrandPhysics dataset
#2118 opened May 26, 2025 by Xiao-Youth Loading…
1 of 6 tasks
Update matbench testing
#2116 opened May 23, 2025 by smgjch Loading…
6 tasks done
More stable MBPP evaluation
#2111 opened May 21, 2025 by f14-bertolotti Loading…
[RULER] Extend 256k and 512k data generators
#2109 opened May 21, 2025 by changlan Loading…
6 tasks done
SRbench
#2105 opened May 20, 2025 by soki123 Loading…
update earth silver benchmark
#2104 opened May 18, 2025 by Zhouzone Loading…
4 of 6 tasks
healthbench
#2099 opened May 15, 2025 by bio-mlhui Loading…
1 task
[Dataset] Add R-Bench (ICML 2025)
#2091 opened May 11, 2025 by uyzhang Loading…
3 of 6 tasks
[Update] Enhancements and Fixes in NeedlebenchV2
#2090 opened May 9, 2025 by Mor-Li Loading…
BaseInferencer batch_size and max_seq_len cast to int
#2074 opened May 5, 2025 by f14-bertolotti Loading…
6 tasks
PromptCBLUE:Life Science dataset
#2073 opened May 4, 2025 by tchenglv520 Loading…
6 tasks done
Phybench
#2069 opened Apr 30, 2025 by epsilondylan Loading…
2 tasks
fix llm judge evaluator import and docs
#2057 opened Apr 27, 2025 by smgjch Loading…
[Dataset]Add GAIA Datasets
#2051 opened Apr 26, 2025 by domonic18 Loading…
1 of 6 tasks
[Feature] Support AntFinix LLM
#2043 opened Apr 24, 2025 by xsq2060 Loading…
replace the model name for new version of bailing
#2034 opened Apr 23, 2025 by cuauty Loading…
6 tasks done
[Dataset] Add SeedBench Dataset
#2020 opened Apr 14, 2025 by ChenZiHong-Gavin Loading…
5 tasks done
[Update] Code related benchmarks update
#2005 opened Apr 6, 2025 by Zhudongsheng75 Loading…
[Fix] Fix default torch dtype loading
#1969 opened Mar 25, 2025 by liushz Loading…
6 tasks
[Model] Add new model: Ola
#1912 opened Mar 4, 2025 by bobo0810 Loading…
4 of 6 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.